Duration and spectral based stress token generation for HMM speech recognition under stress

نویسندگان

  • Sahar E. Bou-Ghazale
  • John H. L. Hansen
چکیده

I n this paper, we address the problem of isolated word recognition of speech under various stressed speaking conditions. The niain objective is to formulate an alternate training algorithm for hidden Markov model recognition, which better characterizes actual speech production under stressed speaking styles such as slow, loud and Lombard effect, without the need for collecting such stressed speech data. The novel approach is to first construct a previously suggested source generator model of word production employing knowledge of the statistical nature of duration and spectral variation of speech under stress. This is used in turn to produce simulated stressed speech training tokens from neutral tokens, and thus replace neutral da ta used in the recognizer training phase. The token generation training method is shown to improve isolated word recognition by 8% for slow speaking style, 14% for loud speaking style, and 24% for speech under Lombard effect when compared to neutral trained isolated word recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress

In this study, a novel approach is proposed for modeling speech parameter variations between neutral and stressed conditions and employed in a technique for stressed speech synthesis and recognition. The proposed method consists of modeling the variations in pitch contour, voiced speech duration, and average spectral structure using hidden Markov models (HMM’s). While HMM’s have traditionally b...

متن کامل

Synthesis of stressed speech from isolated neutral speech using HMM-based models

In this study, a novel approach is proposed for modeling speech parameter variations between neutral and stressed conditions and employed in a technique for stressed speech synthesis. The proposed method consists of modeling the variations in pitch contour, voiced speech duration, and average spectral structure using Hidden Markov Models (HMMs). While HMMs have traditionally been used for recog...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

High-Order Hidden Markov Model and Application to Continuous Mandarin Digit Recognition

The duration and spectral dynamics of speech signal are modeled as the duration highorder hidden Markov model (DHO-HMM). Both the state transition probability and output observation probabilities depend not only on the current state but also several previous states. Recursive formulas have been derived for the calculation of the log-likelihood score of optimal partial paths. The high-order stat...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994